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(57) Abstract: The present invention 
relates to a method and apparatus for 
authenticating various types of digital 
certificates by using a text embedding 
method. The execution of the present 
invention is divided into two cases, one 
case including images in the certificate 
and another case not including the 
images in the certificate, according 
to the characteristic of the certificate- 
In the former case, a text embedding 
method is applied to images included 
in the certificate to embed a series of 
information text (a specific number of the 
certificate, and issuing organization, name 
of a person in charge, etc.) designated 
by a person who issues the certificate into 
the images. In the latter case, there is 
generated an image imperceptible to the 
human eye but having the same color as 
the ground color of certificate. 
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METHOD FOR EMBEDDING AND EXTRACTING TEXT 
INTO/FROM ELECTRONIC DOCUMENTS 

5 BACKGROUND OF THE INVENTION 

Field of the Invention 

The present invention relates to a method and apparatus for preventing a 
10 certificate from being forged and altered and authenticating the certificate as the original 
copy based on the feature of the digital certificate. More specifically, the present 
invention is to issue the certificate after embedding invisibly a text containing a series of 
information to authenticate the certificate into the digital certificate, and authenticate and 
confirm the digital certificate by extracting the information text embedded through the 
15 method and apparatus suggested in the present invention and confirming the content of the 
extracted information text. 

Description of the Related Art 

20 

In advance, "embedding" means that not only a first object is physically arranged 
in a second object but also the content of the second object using a specific algorithm 
based on the content of the first object is changed so that the content of the first object can 
be extracted with the algorithm later used and the second object. In particular, in case 
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where the digital certificate includes an image format (logo or seal image representing an 
issuing authority, etc.), an information text is embedded into the image and in case where 
the digital certificate does not include an image format, a transparent image is generated to 
embed an information text. The information text embedded image is inserted in the 
certificate. Absolutely, there is no change to the naked eye. In other words, the 
embedded information text is embedded into the image of the certificate or the generated 
transparent image based on the content which is included in the certificate so as to make an 
visible confirmation impossible and is employed for authenticating the certificate after- the 
certificate has been issued. The present invention relates to a method and apparatus for 
embedding the information text into the image which is included in the certificate. In 
case where an exterior operation is applied to the embedded text, e.g., the digital data 
processing such as compression, filtering, re-sampling, cropping and the like is processed, 
it is easily removed and the authentication of the certificate is decided based on whether 
said information text exists or not. In case where a third party modifies the content of the 
information text on purpose, the embedded information text is extracted to find out the 
changed situation by deciding whether the extraction is possible or not and comparing with 
content of the first embedded text and deciding whether to be identical or not 

At present, as Internet users have sharply increased, trade of products through an 
electronic commerce has activated. In the trade of products or online/offline transaction, 
authentication of the digital certificate has been emerging as an important procedure. It is 
a tendency that process of issuing a certificate including not only such electronic 
commerce certificates but also various certificates related to civil afFair and certificates 
issued from each school, etc. has been computerized and electronized. However, the 
reason why it is impossible to issue a certificate as a digital document at present is a 
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forgery and alternation thereof. 

The technique for authenticating the digital certificate includes an authentication 
function application, blank controlling method, bar code method, etc. In the method of 
applying an authentication function, suppose that both certificate issuer and holder share a 
5 common secrete value S. The authentication function includes a secrete key algorithm, 
message digest and the like wherein the representative function is ^ hash function. 
Authentication using the hash function is proceeded as follows: First, an issuer of 
certificate of authentication calculates the hash value by the certificate, secrete key value S, 
and hash function and adds the hash value result to the certificate and sends the certificate 

10 to a person who requests to issue a certificate. A person who receives a certificate finds 
the hash value by the session key S that the person already knows and the hash function. 
The above hash value is compared with the hash value which is sent from the issuer of 
certificate of authentication. If the two hash values are the same, its authentication is 
confirmed and if not, the certificate of authentication is regarded as being altered. 

15 Comparison of the hash value enables to confirm identity (authentication) of the other 
party of the transaction and detect whether the certificate is forged or altered. . 

Such authentication using the authentication function has the following 
drawbacks: 

First, authentication is based on a text document. If a document of different 
20 format such as an image or voice mark is embedded into the certificate, it should be 
separately authenticated or its authentication is impossible. 

Second, it can accurately determine if the document data has been modified. 
However, if modified, it is impossible to restore the original document. 

Third, since a signature should be added so as to confirm integrity of the 
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certificate of authentication,, the size of the original certificate has been enlarged. 

The authentication method using a blank controlling method or letter-form 
controlling method is based on ASCII code. The ASCII includes 94 picture characters 
and 34 various characters which are only used for controlling but not printed out. The 

5 picture characters include 26 capital characters, 26 small characters, 10 numbers and 32 
special characters (%, *, $, etc.). The 34 controlling characters are employed for 
controlling the destination of data as the standard already designated and arranging texts to 
be printed out. The controlling character is classified as the following three types of v 
characters, i.e., format effector, information separator, and communication-control 

10 character. The format effector controls the layout of print-out and includes familiar 
controls for a type such as backspace (BS), horizontal tabulation (HT), carriage return 
(CR). The information separator is used for dividing data into a paragraph or page and 
includes record separator (RS), file separator (FS), etc. The communication-control 
character is used for preparing format in which a text message is conveyed through a 

15 phone line such as STX (start of text), ETX (End of Text), etc. Most of computers deal 
with a unit of byte as amount of 8 bits. Hence, ASCII character is mostly stored one by 
one each byte. The remaining one bit is used for a different purpose if necessary. For 
example, a printer sets up the highest effective bit as 0 and separates ASCII as 8 bit The 
blank controlling method or letter-form controlling method, using such controlling 

20 character, controls the blank of the text document of the original certificate to express the 
information to be embedded or changes the letter- form to express the embedding 
information. In case where the user's information is embedded on the object of the above 
code of the document, the drawbacks are as follows: 

First, the text document itself of the substantial certificate is changed. The 

4 
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change itself is not shown in the document Hence, in case the document is again restored 
after having been changed, a restoration signal is definitely damaged. , That is, it is 
impossible to perfectly restore the text document. 

Second, the region wherein a user can embed the information is small. It can not 
5 but embed the information that a user wants in a restricted manner. 

Authentication using the bar code method is at present actively applied. The bar 
code is a code system expressing the information with a pattern of arrangement of black 
bar and white bar having various widths. . Use of bar code makes it possible to deal with 
office work in an effective, fast, and accurate manner. As its effective value has been 
10 admitted in the nation, it is used in various fields of affairs such as office automation, 
factory automation, etc. around the distribution industry. At present, various ID card 
employs the bar code.. For a student card, the bar code employed therein has a card 
recognized in a card recognizer when students come in and out the library and a 
recognition apparatus recognizes the bar code, thereby confirming authenticity of ID cards 
15 and identity information. However, such authentication using the bar code has the 
following drawbacks: 

First, the bar code is a code system expressing the information with a pattern of 
arrangement of black bar and white bar having various widths, thereby making it possible 
to discriminate it with the eye and thus its alternation is possible. 
20 Second, even though the bar code is scanned many times, failure to read the bar 

code leads to inputting numbers using a keyboard. 

Third, if the bar code is uncertain or is damaged during the distribution, a scanner 
may read it to another numbers. 
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SUMMARY OF THE INVENTION 

The techniques in the prior art have various problems and drawbacks in 
authenticating the certificate as mentioned above. Such problems lead to the forgery or 
5 alternation of the certificate and thus the certificate can be misused on an illegal purpose. 
Therefore, it is an object of the present invention to provide a method and apparatus for 
embedding and extracting a text containing some information capable of providing 
authentication of the certificate using a method different from the existing method in order 
to firmly provide authentication of the digital certificate (identity certificate of 

10 authentication, deposit certificate of confirmation, trade certificate, graduation certificate, 
degree certificate, cash deal certificate of authentication, commodity receipt, and credit 
card certificate of authentication, etc.) that is traded online/offline. 

It is another object of the present invention is to provide the necessary information 
hidding. When a transmitter is sending the important information to a receiver, it can be 

15 used as a method and apparatus wherein the information text to be sent is embedded to the 
image-formed data or, the text" is transmitted after being manufactured as the image form 
and the receiver can restore the original information accurately. Such process makes it 
possible to transmit the information without occurring any warning on a network detector 
which are generalized over networks or other information detecting mechanism. The 

20 information can be securely ensured without encryption process between transmitter and 
receiver. 

The method provided in the present invention for acquiring the above object is to 
analyze the feature of the existing certificate and authenticate the certificate by applying an 
information text embedding method appropriate to the feature. The text embedding 



WO 02/37309 



PCT/KR01/01862 



method for authenticating the digital certificate according to the present invention 
comprises the steps of classifying the certificate by the certificate analysis, generating the 
information text according to the classified certificate, bit-streaming the generated text, 
converting the image data which is included in the certificate into the one dimensional data, 
5 and embedding said bit-streamed text data in said one dimensional image data. The step 
of extractinjg the embedded text comprises the steps of selecting the text embedded image, 
extracting the embedded bit-streamed data, and reading the embedded original text 
information from the extracted bit-stream data. The step of authenticating the certificate 
includes an apparatus and software for issuing the information text embedded certificate 

10 and an apparatus and software for authenticating the text embedded certificate. The 
apparatus according to the present invention, i.e., a computer, receives commands 
corresponding to said steps from the recording . medium which stores programs 
corresponding to said steps to perform embedment and extraction of it 

In general, the certificate being issued has two features as follows: 

15 First, an image-formed picture, registered seal or company logo, etc. representing 

the legal entity who issues the certificate or copyright holder, or confirming the issuer is 
embedded in the certificate being, issued. Such image embedding enables to promote the 
public trust of the certificate, digitalize the individual information which is included in the 
certificate and certificate form, etc. and issue the public document to a remote place by the 

20 computerization of the issuing procedure. 

Second, the certificate being issued uses frequently a specially designated paper in 
order to approve the issuer or protect the right of the copyright holder or represent the 
peculiarity of an issuing authority. The special papers include frequently an image 
representing the issuing authority and are made in an image form or other document file 
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form. 

The present invention employs the text embedding method containing the 
information as a method for authenticating the certificate based on the above features of 
the digital certificate. Such text embedding technique and method are directed to 
- 5 protection and authentication of the digital certificate and is a new technique discriminated 
from the existing method of authenticating a certificate. In particular, the text embedding 
method used in the present invention is a method of embedding the bit-streamed text data 
and a method and apparatus including the detailed algorithm of an invention are explained 
referring to the drawings hereinbelow. 

10 For a reference, before explaining the content of the present invention, it is 

necessary to review images which are included in the certificate, which is to analyze types 
of the image which are included in the certificate and embed the information appropriate to 
said image features. The image data contained in the certificate includes a symbolic 
image representing an issuing place or a registered seal, seal image representing an issuing 

15 authority. An example of such image data is shown in Fig. 14. 

BRIEF DESCRIPTION OF THE DRAWINGS 

20 Fig. 1 is a flow chart illustrating conceptually the embedding process of the text 

embedding apparatus of the present invention. 

Fig. 2 is a flow chart illustrating conceptually the text extracting process of the 
present invention. 

Fig. 3 exemplifies types of various digital certificates and illustrates the feature 
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analysis of the digital certificate. 

Fig. 4 is a flow chart illustrating the process to the step of generating a text to be 
embedded according to each feature by classifying certificates based on the feature of the 
certificate. 

Fig. 5 illustrates the bit streaming process of the text to be embedded with regard 
to the certificate based on an English text document. 

Fig. 6 illustrates the bit streaming process of the text to be embedded in case a 
certificate is written as a language other than English based on the classification of Fig. 4. 

Fig. 7 illustrates the process of changing an image included in the digital 
certificate to the one dimensional data or, in case an image is not included in the certificate, 
changing the generated transparent image to the one dimensional data. 

Fig. 8 illustrates the process of generating a transparent image. 

Fig. 9A illustrates the process of embedding a text data. 

Fig. 9B illustrates an example of the process of embedding a text data. 

Fig. 10 illustrates the process of extracting a text data. 

Fig. 11 illustrates the result of comparing a grayscale image before embedding a 
text with a grayscale image after embedding a text along with the histograms. 

Fig. 12 illustrates the result of comparing a color image before embedding a text 
with a color image after embedding a text along with the histograms. 

Fig. 13 illustrates the result of comparing a registered seal image before 
embedding a text with a registered seal image after embedding a text along with the 
histograms. 

Fig. 14 illustrates examples of an image data included in the certificate. 
Fig. 1 5 illustrates an example of the certificate including a bar code image. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Fig. 1 is a comprehensive flow chart of the step of embedding a text into the 
certificate for authenticating the certificate (10) regarding the general flow of the present 
5 invention. In the step of classifying the certificate among the steps of Fig. 1, the 
certificate is classified based on two preconditions of whether an image-formed data exists 
in the certificate (17) and the certificate is written as English (11). . The specific process of 
classifying the certificate is explained in Fig. 4. In the step (14) of generating the text to 
be embedded,, after proceeding the step of classifying the certificate by a language, in case 

10 the certificate is on the basis of English, the step (13) of selecting a method of embedding a 
text is undergone and in case the certificate is on the basis of other language, the step (12) 
of selecting a method of embedding a text by the corresponding language is undergone. 
In case the certificate is on the basis of English, the step of selecting a method is in detail 
explained in Fig. 5. In case the certificate is on the basis of other language, the process of 

15 selecting a language is in detail explained in Fig. 6. The generated text is bit streamed as 
a unit of 6, 7, 8, and 16 bits through the step (15) of bit-streaming. In the step of 
classifying the certificate, in case image-formed data exists in the certificate, the step (17) 
of directly converting the image data to the one dimensional data is undergone. In case 
image-formed data does not exist in the certificate, the step (18) of generating a transparent 

20 image is undergone. The step of generating a transparent image is in detail explained in 
Fig, 8. The generated transparent image also undergoes the step (19) of converting a 
transparent image data to the one dimensional data (20). RSI method, which is applied to 
the step of converting an image to the one dimensional data, will be in detail explained in 
Fig. 7. The step (21) of embedding the bit-streamed data in the one dimensional data is 
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proceeded by Equation 4. A key is employed for a secure protection of information in the 
embedding step. 

Fig. 2 is a comprehensive flow chart of the step of extracting information text 
which is a step for authenticating the certificate of the present invention. The step of 
5 extracting a text, which is a step of extracting the embedded text, comprises the step (32) 
of selecting the text embedded image, step (33) of extracting a bit stream, step of 
constituting a text from the extracted bit stream based on the language of the certificate. 
The step of selecting an image selects a text embedded image. The embedded image also 
includes a transparent image. The bit stream data is extracted by applying a key value 

10 employed in first embedding a text from the selected image. Likewise the embedding 
process, the bit stream data is extracted from the pixel value information of an image by 
Equation 4. After detennining whether the extracted bit stream data is an English 
certificate (34) or certificate of other language (38), the text data is generated (42) through 
a reverse process of selection methods (35-37, 39-41) selected in the step of embedding a 

15 text (Figs. 5 and 6). 

Fig. 3 is to analyze the feature of the digital certificate. The digital certificate 50, 
51 has a specified format, as shown in Fig. 3, and frequently includes image-formed data 
52. In the present invention, all cases of a certificate itself is an image, a certificate does 
not include image-formed data, a certificate does not include image formed data are 

20 possible. Because in case a certificate is a image-formed file, the information text is 
embedded into the certificate image itself, in case an image data exists, the information text 
is embedded into the image data, and in case an image data does not exist, a transparent 
image is generated and the information text is embedded into the image and then included 
in the certificate. 
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Fig. 4 is a drawing of the process of classifying the digital certificate. The 
classification is based on lands of language used in the certificate and pattern of the 
certificate. The certificate is classified on the following two conditions in the step of 
classifying the certificate. 
5 One is to determine (62) whether an image-formed data is included in the digital 

certificate (61) and another is to determine (63, 66) whether the certificate is written in 
English or other language. The above two conditions enable to classify the certificate. 
If an image-formed file exists, it is determined (63) whether the digital certificate is written 
in English or other language. It is possible to determine whether "written in English?" 

10 based on ASCII code employed and the image separated from the text exists in the form of 
an object (in case of HTML, it is in the form of a link) in the document Pursuit of an 
object makes an automatic determination possible. In case the entire certificate is an 
image, a text included in the existing certificate is extracted through a character recognition 
system like a neural network net and automatically classified based on the determination of 

15 ASCII code employed. In case the entire certificate is an image, "image included?" 
automatically determines that an image is included. If in English, a text document to be 
embedded is written based on the embedding text generating portion 1 (a) (64) in Fig. 4. 
If not in English, a text document to be embedded is generated based on the embedding 
• text generating portion 2(a) (65) in Fig. 4. In case an image-formed data does not exist, a 

20 step of preparing a transparent image is further added as compared to the case an image- 
formed data exists. The process of generating a transparent image will be explained in 
Fig. 8. After preparing a transparent image, it is determined whether the certificate is 
written in English or other language. If the certificate is written in English, a text 
document to be embedded is prepared based on the embedding text generating portion 1(b) 
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(68) in Fig. 4. If not in English, a text document to be embedded is generated by the 
embedding text generating portion 2(b) (67) in Fig. 4. It is very important to determine 
whether it is in English or not. The present invention also provides a method of 
authenticating the digital certificate in Korean, Japanese, and Chinese other than English. 
And the above method is sure to be applied to the certificate in any other languages. 

Fig. 5 shows a bit-streaming process of a text to be embedded regarding the 
English text document-based certificate. In the step of generating a text to be embedded 
through a process of classifying the certificate in Fig. 4, a text generated in the embedding 
text generating portion 1(a) (70) and the embedding text generating portion 1(b) (71) is 
English text data. (In case of an English certificate, an issuing authority mostly seems to 
use English as a main language and thus supposes that a text to be embedded is prepared in 
English.) An information text to be embedded in the English certificate, i.e., 8 bit ASCII 
code-based text is bit-streamed through the following process: First, there are basically 
three methods (72-74) in bit streaming and the classification of bit streaming methods is 
based on table of ASCII code mainly referred to. ASCII code is shown in Table 1. 
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Method 1 in Fig. 5 is to bi.-s.ream (75) a .ex. document as 7 bits based or, Me 1 . 
hr case of making binary a ,ex, docnmen, as 7 WU, 100 Bngiish ebnracto ennsuhne a bi. 
stream of 700 bits. 

An English code is usually proceaacd aa a oni. of] byte. Th. remaining one bi. 
differs lime by lidle according to its use. Metood 2 adds a parity bi, in the form of the 
remaining even nomber or odd nnmber to 7 bits of Tabic 1 and consStotoa a , byte-formed 
code, thereby bit-streaming (76). When bit-sheaming a tea, da«a to be embedded in the 
ceruncato, addition of a pan* bi, in me even ntunber or odd number form to 7 bits shows 
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embedding and when embedding, ASCII even code Table or ASCII odd code Table is 
referred to. Method 2 constitutes a bit stream of 800 bits per 100 English characters. (See 
International Standard ASCII even code Table and ASCII odd code Table) 

Method 3, considering the feature that a part of codes in Table 1 are not used, 
reduces number of codes being used as shown in Table 2 to express 6 bits and bit-stream 
(77) as a unit of 6 bits. Method 3 constitutes a bit stream of 600 bits per 100 English 
characters. 
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Hence, in case of the same bit stream, Method 3 can embed more text documents 
than other methods can do, while not embedding the same text as text constituting the 
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certificate. However, Methods 1 and 2 can embed the same document as text of the 
certificate. In other words, reduction in codes of Method 3 makes it impossible to express 
the information showing a character shape of the text constituting the certificate. It 
means it is impossible to express a style of character as it is. 
5 Fig. 6 illustrates a process of bit-streaming a text to be embedded based on the 

classification of Fig. 4. A process of bit streaming a text based on Korean, Chinese, and 
Japanese other than English is explained herein. The text data (82) to be embedded which 
is generated in the embedding text generating portion 2(a) (80) and the embedding text 
generating portion 2(b) (81) in Fig. 4 is not prepared in English and thus 2 byte code based 

10 on other Unicode. Therefore, a type of language of the certificate is classified to bit- 
stream the text data to be embedded in the certificate. 

The certificates prepared in language other than English, i.e., those based on 2 
byte code of Korean, Chinese, and Japanese, etc. are all bit-streamed (83) as a unit of 16 
bits. Regarding the Universal Code System distributed in May, 1993, ISO 10646- 

15 1/Unicode LI, since Unicode and ISO 10646 were integrated, technically speaking, ISO 
10646 is "almost" similar to Unicode version LI. UCS-2 of ISO 10646-1 and Unicode 
are united into one. At present Unicode 2.0, a revised edition of ISO 10646, has solved a 
partial problem existing in the version 1.1. ISO 10646/unicode is now commonly used. 
The Universal Code System supports Korean, Chinese, and Japanese, etc. in the form of a 

20 16 bit code. Based on such International Standard, the present invention, when bit 
streaming the digital certificate, bit-streamed the digital certificate in Korean, Chinese, and 
Japanese, etc. as a unit of 16 bits. (See International Standard ISO 10646-1 UCS 2 & ISO 
10646 Unicode 2.0 / 3.0 Code Table) 

Hence, when bit-streaming a certificate in English and a certificate in language- 
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other than English, the length of the bit stream is differentiated. In other words, in case of 
an English certificate, amount of the text capable of being embedded in the certificate is 
twice as much as that of the Unicode-based certificate in other language other than English. 

Fig. 7 illustrates a process of converting an image included in the digital 
certificate or, in case an image is not included in the certificate, the generated transparent 
image into one-dimensional data. There are various methods in converting an image into 
one-dimensional data. Raster Scan Image method among commonly used methods is 
applied to the present invention. Hilbert Scan Image method, Z Scan Image method, etc. 
also can be applied. 

Raster Scan Image converts two-dimensional data into one-dimensional data, 
which is most simple and widely used method. Starting at the point of left above of an 
image, a lower line (next row) pixel value of the image, from the highest line (row) in turn 
is re-arranged in a one-dimensional manner and thus one-dimensional image signal is 
prepared. In case an image extracted from the certificate is a black-and-white color 
image 91, two-dimensional data is converted into one-dimensional data through a process 
of Raster Scan Image. In case an image extracted from the certificate is a color image 90, 
the image is decomposed to red, green, and blue components and is converted into one- 
dimensional data through a process of Raster Scan Image and then combined in order of 
red, green, and blue components to be converted into one-dimensional data. 

Fig. 8 . illustrates a process of generating a transparent image. A transparent 
image is manufactured in case where the background condition of the digital certificate to 
be issued is white. In case where the background condition "is not white, a transparent 
image is generated based on the color information of background condition. The process 
of generating a transparent image is as follows: A transparent image generated for 
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indicating accurate information of bit stream must be quantized to at least 2 bits. When 
the image is quantized to 2 bits, it comprises four pixel value (100). Each pixel value is 
given as 0, 1, 2, 3 and 0 corresponds to pixel value RGB=[255,255,255] (101) and 1 
corresponds to pixel value RGB=[255,255,254] (102). A transparent image is generated 
using said 0 and 1 . In selecting 0 and 1, there are methods of all selecting 0, all selecting 
1, and selecting 0 and 1 at random (103). Size of a transparent image to be generated is 
determined (104) by size of text to be embedded. Number of pixels of a transparent 
image is at least bigger than length of a bit stream to be embedded in order to embed 
sufficient amount of text for authentication. And since the image is two-dimensional data, 
the size should be as large as multiplication of two natural numbers. 

Fig. 9A illustrates a process of embedding text data. Step of embedding the bit- 
streamed data is as follows: First, in case an image included in the certificate is a grayscale, 
the maximum value Max(Pixel) of pixel value of image is changed to Max(Pixel)-l (120). 
In case an image extracted from the certificate is a color, the image is decomposed (116) by 
its color and the maximum value MAX(Pixel) of pixel value of red, green, and blue 
components is changed to Max(Pixel)-l (117). This is such that the maximum value does 
not go beyond the range of 0~2 m -l since pixel value changes in embedding an information 
text In case the certificate does not include an image, a newly generated transparent 
image is employed as it is. Like in Fig. 5 or Fig. 6, text (110, 111) to be embedded is bit- 
streamed (112, 113) and like in Fig. 7, image data is given as one-dimensional data (118). 
The bit-streamed text data is indicated as B(i)(l 14). As a result of converting said image 
to one-dimensional data, the data is indicated as image(i) and the definition scope of one- 
dimensional data is classified as a color image and grayscale image, which are respectively 
indicated in Equation 1 and Equation 2. N x M shows size of width and length of an 
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image. 



XT') 



Equation 3 is a range of value of one-dimensional data. Herein m indicates a 
quantized bit number of the one-dimensional data. When the bit streamed data is 
embedded in said image data which is converted to one-dimensional data, Key m indicates 
key value designating the first embedding, location and imagefKey^ + 1) indicates a 
10 location virtually embedded in one-dimensional data. 



The principle of embedding (122) is as follows: 
15 In case comparing the bit-streamed value with the embedding location value 

satisfies Equation 4, pixel value of an image, Equation 5, is not changed. If that fails to 
satisfy Equation 4, pixel value of one-dimensional image is changed by image(Key^ + i) 
+1.. 



2Q image(key^^i)^B(i) mod 2 



imag4far m +Q (5) 
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For a specific example, as shown in Fig. 9B, a process of embedding the text 
"four-life" (126) in 6 X 6 . pixel image (125), based on Method 3, is as follows -(See Table 
2): First, 6X6 image is changed to one-dimensional data Image® (127) by applying RSI 
method. Table 2 shows that and V* correspond, to "5" and "14", respectively. 
Hence, a decimal number row "5, 14, 20, 17, 49, 11, 8, 5, 4" are obtained referring to 
"four-life" based on the index of Table 2. The above row is changed to a bit row in a unit 
of 6 bits and thus obtaining a. bit-streamed data B(i). Since image(l)=123 and B(1)=0 
based on Equation 4, pixel value is changed to 124 by applying 123+1. So to speak, since 
the first pixel value of one-dimensional image is 123, if 123 is divided by 2, the remnant is 
1 . Since the remnant 1 is not identical to the first bit value 0 of the bit stream, it makes 
124 by adding 1 to the original value of the first pixel. Meanwhile, since when the 
second pixel Image(2)=124 is divided by 2, the remnant is identical to 0, the second bit 
value of a bit stream, the pixel value is not changed. The bit stream information of all 
B(i) is embedded in 6 X 6 image in the same way. The image is constituted (124) by 
applying an inverse RSI method (123) after embedding. Such constituted image is an 
image (1 29) in which a text is embedded. 

A key is designated by a decimal number of 1-8 figures arbitrarily decided by a 
certificate holder or certificate issuer. Naturally the key is not used as it is. The range 
(D in the following Equation 6) designated by the key value through a mathematical 
calculation by the Mowing Equations 6, 7, and 8 is prepared. A certificate issuer does 
not care about the process of being changed to the range designated by the key value. 
That is ail that he memorizes information of the first designated key. The key employed 
in the present invention are under two restrictions as described below. 
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The first restriction is as follows: 

A key is in a close contact with the number of pixels of the image extracted from 
the certificate, i.e., the size of image, and is affected by the size of image. The size of 
image is the range of bit stream generated in the text capable of being virtually embedded. 
5 Herein the maximum value of the definition i of image(i) is Max(image,i). 

The second restriction is as follows: 

The length of bit stream to be embedded is decided according to how much the 
maximum value of / in the B(i) is. Herein the maximum value of B(i) is Max (BJ). The 
range of the value of key Key^ is D. The value of D has the following relation with the 
10 maximum value Max(BJJ and the maximum value Max(image,i) of the bit stream obtained 
from the size of image and capable of being embedded. 



= {Vkey^O^keysn <Max{image 9 i)-Max{BJ)} ■ 

(6) 



15 If the inputted value of Key^ is bigger than the value indicated in Equation 7, the 

key value is used after the same calculation as Equation 8. If the inputted value of Key^ 
is smaller than the value indicated in Equation 7, the key value is used as it is. If virtually 
used key value is Key m]9 the value is the same as Equation 8. 



2Q Maxiimagerf-MaxiB.i) (7) 

^yi 3 Keyset mod Max(image,i) - Max(B, i) 



The key is used for the following purposes. 
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First, the key is used for designating the location of embedding data of the bit 
streamed text document into one dimensional image data. 

Second, the key is used for a certificate issuer and holder confirming that it is his 
own information. Such is to prevent the following attack. When a person forges a text 
5 document using the key value, it can prevent a document embedded in the image of the 
certificate using this algorithm from being confirmed. 

Third, since the embedding section is decided using the key value, maintenance of 
security of the key value ensures stability of the embedded text data itself. 

As shown in the embedding algorithm, an image is selected or generated based 
10 on the certificate and information text (of necessary information/content of entire 
certificate, etc.) to be embedded is generated based on the certificate. Text information 
prepared by the certificate is embedded into image selected in the certificate, a new image 
indistinguishable visually is prepared. Completion of embedding a text through such a 
. process enables to authenticate the certificate. 
15 Fig. 10 illustrates a process of extracting text data. The extraction algorithm is 

proceeded as follows. First, an image (including a transparent image) into which an 
information text is embedded is selected (131) from an image such as a registered image of 
an individual of issuer, an evidencing picture of a user, or a logo and image of an issuing 
authority which are included in the certificate (130). The selected image is converted 
20 (134) into one dimensional data using Raster Scan Image method (133). Next, a location 
where embedding begins is found by a key of a user or issuer and a bit stream B(i) (135) 
satisfying imageQceysN + V s B(i) mod 2 based on whether the pixel value starting from 
one dimensional data image(KEY SN + 1) is odd number or even number. Subsequently, in 
case of an English-based certificate, the embedding method 1, embedding method 2, and 
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embedding method 3 are selected to combine with a unit of 8 bits, 7 bits, and 6 bits, 
respectively, and thus the above constituted bit stream (136) is changed (137) to a decimal 
number. In case of not an English-based certificate, the above constituted bit stream is 
changed to a decimal number after combining with a unit of 16 bits. And, the constituted 
5 decimal number stream is converted (138) to text data based on Table referred to by the 
embedding method. The result obtained by conversion becomes text data (139) to be 
restored. 

As shown in Fig. 10, a process of restoring the embedded text is an inverse 
. process of the embedding process. 

10 A specific algorithm of the. present invention comprises three basic steps. They 

are steps of generating an information text to be embedded, embedding an information text, 
and extracting an information text. What directly influences the performance of 
algorithm is the change in an image in a process of embedding an information text and 
amount of information to be embedded in a text. An explanation on this matter in more 

15 detail is as follows. First change in an image is explained. 

It could be said that a process of embedding a text document is a process of 
virtually changing an image. In other words, an image is changed by embedding a text 
designated by a user into the image, which could be a process of embedding noise in an 

image. Size of the embedded noise divides the original image and text-embedded image. 

i, 

20 In the present invention, the certificate which is issued as a certificate of authentication and 
provided to a person who requests for an issue does not contain the image which, was 
included in the originally generated certificate. In other words, an image embedded in the 
issued certificate is an image into which an information text generated by an issuer is 
embedded. The original image is not necessary any longer after embedding an. 
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information text 

Fig. 11, Fig. 12, and Fig. 13 are results for comparing grayscale image 140, color 
image 150, and registered seal image 160 before a text is embedded with images 142, 152, 
162 after a text is embedded along with the histograms 141, 143, 151, 153, 161, 163. 
5 Since the change in the pixel value is little, it proves undistinguishable visually. Change 
in the pixel value is proceeded only in the embedded region. 

In comparison with the histograms of Fig. 1 1 , it can be seen that Red component 
into which text data are much embedded is changed. Fig. 12 and Fig. 13 show an 
evaluation of performance of a color image and registered seal image. As a result of 
10 embedding a text in a grayscale image, color image and registered image and comparing 
the change of the embedded image with that of the original image, it can be seen that it is 
impossible to confirm the change in the pixel value visually. 

Next, amount of information to be embedded is reviewed. 

Amount of information embedded as a series of texts is in a direct relation with 
15 the size of image included in the certificate and the level of quantization of the image. 
Table as shown below is a result of calculating amount of embedding a text according to 
the bit number of quantization and size of the image. As seen in the amount of the 
embedded information, in case the certificate includes an image, the content is hardly 
restricted. Usually the certificate contains the text content of about 500-3000 characters. 
20 Hence, size of the information text to be embedded is hardly restricted. 
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[Table 3] 



Image Size 


Bit Number of Quantization 


Information Amount Capable of being 
Embedded 


256 by 256 


Color (24 bits) 


Max 32,768 characters (unit of 6 bits) 


256 by 256 


Grayscale (8 bits) 


Max 10,922 characters (unit of 6 bits) 


153 by 134 


Color (24 bits) 


Max 10,251 characters (unit of 6 bits) 


153 by 134 


Grayscale (8 bits) 


Max 3,417 characters (unit of 6 bits) 



If so, when a third party launches various attacks (modification, etc.) and makes a 
processing on the image which performed the method suggested in tHe present invention, it 
5 is explained what effect is produced. 

There are many kinds in image processing. The present invention has made a . 
variety of image processing regarding the text embedded image experimentally. In case 
the compression of the text embedded image proceeds in the same frequency , space as 
JPEG, the embedded text disappeared. In case of applying a similar method of filter, the 
10 embedded text can not be restored. For a different image processing method, Histogram 
equalization, Image sharpening, Image cropping are applied. As a result, the embedded 
text document can not be still restored. Almost image processing method can not extract 
the embedded text except for a geometric image processing including rotation. 

Table 4 as shown below is an analysis result of the image processing. (X - 
15 Extraction Impossible, O - Extraction Possible) 
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[Table 4] 



\ Image 










Color 


Grayscale 


Registered 




Image 


Image 


Seal Image 


Processing Method\ 








JPEG Compression 


X 


X . 


X 


Image Cropping 


X 


X 


X 


Equalization 


X 


X 


X 


Sharpening 


X 


X 


X . 


Smoothing 

(Feature Extraction, 
Differential) Filtering 


X 


X 


X 


X 


X 


X 


Geometric Rotation 
Change . 


0 


o 


0 



That is, the method suggested in the present invention has property of a Fragile 
Watermark indispensable for the purpose of "authentication". (Refer to a patent filed in the 
5 name of the present applicant regarding Fragile Watermark) 

Alternation of the certificate includes alternation of text (name or data, etc.) which 
is the content of the certificate and alternation of image which is included in the certificate. 
In the event of altering the text content included in the certificate, extraction of the 
certificate content information text embedded into an image makes it possible to find the 
10 alternation and restore me original text document. According to the above result, since all 
the text content included in the certificate can be embedded into the image of the certificate, 
if extraction of the embedded text can restore all the content of the original certificate, it 
can be confirmed whether it is forged or altered. In the event of altering the text 
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embedded image which is included in the certificate, the embedded text will not be 
extracted. When it is impossible to extract the text which is embedded in the above 
image, the certificate is regarded as being invalid. 

Forgery of the certificate is to change a part of the certificate in the manner as one 

5 wishes. In the event of forging the certificate, it is necessary to find out the key used by a 
certificate issuer or user. If not so, it is possible to embed a text into the above image but 
its authentication is impossible. Thus, the forgery can be detected. 

As seen in Fig. 15, it is possible to use a combination of bar code and text 
embedding method described above according to other embodiment of the present 

10 invention. This is a method of adding bar code to the image which is embedded into the 
certificate. More specifically, it is a method of adding bar code 171 to the image type 
which is included in the certificate at present and applying the present invention. This 
type of method is divided into two cases: one image is applied by combining bar code and 
other image; and bar code and image into which a text is embedded are separately applied. 

15 In case of applying one image by means of a combination, the image part 

overlapping bar code is made a block processing and thus not embedding a text, making 
possible to. maintain the stability of bar code and text. In case of a separate use, bar code 
makes authentication of the certificate easy and fast. Also, advantageously more 
performance of a process of the present invention to extract text which is embedded into 

20 the image is a double authentication, making it possible to intensify security and stability. 
In particular, in case of using bar code, when a person who tries to forge or alter the 
certificate simply sees bar code and attacks the part, the part of image 170 into which a text 
is embedded invisibly may be neglected, which has an effect of a double security. 

Although it is not illustrated, the text embedding and extraction described 
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above are performed using a computer. A usual programming language reads the digital 
certificate for its object, i.e., an electronic document, and distinguishes text and image and 
prepares a program wherein a text is embedded into image and then stores it in a recording 
medium, thereby making it performed in a computer. A person having ordinary skill in 
5 the art can easily develop a variety of programs realizing the present invention. In other 
words, as widely known, the program-related data can be temporarily stored in a hard disc 
of computer from an exterior memory medium such as a floppy disc and can be stored in 
RAM from disc when the above text embedding program is operated. A memory 
medium for providing a program code, e.g., floppy disc, hard disc, optical disc, optic- 
10 magnetic disc, CD-ROM, CR-R, magnetic tape, nonvolatile memory card, ROM, etc. are 
employed. When the present invention is applied to aforementioned memory medium, 
program codes corresponding to aforementioned embedding steps are stored in memory 
medium. 

15 

INDUSTRIAL APPLICABILITY 

The present invention is sure to authenticate various types of digital certificate. 
Prevention of forgery or alternation of the certificate has an effect of avoiding a variety of 
20 damage caused by forgery or alternation. The present invention can distinguish the 
certificate from being forged/altered and restore the forged/altered content when 
forged/altered. Furthermore, the present invention can safely deliver the signature of a 
certificate issuer to the receiver and confirm the accurate identity of the other party by 
extracting the content of a text. 
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What is claimed is: 

1 . A method for embedding text into electronic document, comprising : 

converting at least a part of said text to bit stream; 
5 converting at least a part of image data to one dimensional data in case where said 

electronic document contains said image data, and generating transparent image data to 
convert the generated transparent image data to one dimensional data in case where said 
electronic document does not contain said image data; 

embedding the text which was converted to bit stream into said one dimensional 

10 data; 

restoring said one dimensional data embedded with said text to image data of said 
electronic document; and 

embedding the restored image data into said electronic document 

15 2. The method of claim 1, wherein said text is the information which is used for 
authenticating said electronic document. 

3. The method of claim 1, wherein said transparent image is at least a part of background 
of said electronic document. 

20 

4. The method of claim 1, wherein said text image is at least a part of said electronic 
document selected from said electronic document. 

5. The method of claim 4, further comprising : 
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distinguishing the kinds of language of said text; and 

converting said text which will be embedded to said bit stream corresponding to 
each language. 

6. The method of claim 1 , wherein said embedding further comprises : 

calculating the remnant of each unit data of said one dimensional data using a 
remnant operator; and 

maintaining said unit data in case where Ihe remnant is equal to one bit value of 
said bit stream, and adding 1 to said unit data in case where the remnant is not equal to one 
bit value of said bit stream, said remnant being compared with said one bit value of said bit 
stream. 

7. The method of claim 1, wherein at least a part of said text is converted to said bit 
stream after block encoding it or calculating message digest value of it is performed. 

8. The method of claim 1, wherein said converting uses one technique among the Raster 
Scan Image, Hilbert Scan Image, and Z Scan Image. 

9. The method of claim 1 , wherein at least a part of said image data includes barcode. 

» 

10. The method for extracting text from the electronic document into which the text was 
embedded by any one of claim 1 to claim 9, comprising : 

selecting the text embedded image data from said electronic document; 
converting at least a part of the selected image data to one dimensional data; 
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generating said series of bit stream in accordance with said one dimensional data; 

.and 

• extracting the embedded text by converting said series of bit stream to text. 

11. The method of claim 10, wherein said generating said series of bit stream in 
accordance with said one dimensional data comprises : 

calculating the remnant by dividing each unit data of said first order by 2; and 
generating said series of bit stream from said remnant. 

12. A computer-readable recording medium for storing computer program * for 
authenticating electronic document, the program allowing a computer to execute the steps 
of: 

converting at least a part of text to a bit stream; 

converting at least a part of image data to one dimensional data in case where said 
electronic document contains image data, and generating transparent image data to convert 
said transparent image data to one . dimensional data in case where said electronic 
document does not contain image data; 

embedding said text which was converted to said bit stream into said one 
dimensional data; . 

restoring said one dimensional data into which the is embedded to image data of 
said electronic document; and 

inserting the restored image data into. said electronic document. 
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Fig. 10 
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